Bacterial small-molecule three-hybrid system

ABSTRACT

A transgenic bacterial cell comprising (a) a dimeric small molecule which comprises a first moiety known to bind a first receptor domain covalently linked to a second moiety known to bind a second receptor domain; (b) nucleotide sequences which upon transcription encode i) a first fusion protein comprising the first receptor domain, and ii) a second fusion protein comprising the second receptor domain; and (c) a reporter gene wherein expression of the reporter gene is conditioned on the proximity of the first fusion protein to the second fusion protein. The cell is also adapted for use in a method for identifying a molecule that binds to a known target in a bacterial cell from a pool of candidate molecules, and a method for identifying an unknown target receptor to which a molecule is capable of binding in a bacterial cell. Also described are compounds and kits for carrying out the methods.

This application claims priority of U.S. Ser. No. 10/132,039, filed Apr.24, 2002, the contents of which are herein incorporated by referenceinto this application.

Throughout this application, various publications are referenced byArabic numerals in parentheses. Full citations for these publicationsmay be found at the end of the specification immediately preceding theclaims. The disclosures of these publications in their entireties arehereby incorporated by reference into this application in order to morefully describe the state of the art as known to those skilled therein asof the date of the invention described and claimed herein.

This invention has been made with government support under NationalInstitutes of Health grant GM62867. Accordingly, the U.S. Government hascertain rights in the invention.

FIELD OF INVENTION

This invention relates to the field of identifying protein targets andtheir corresponding small-molecule drugs and other biomolecules usingthe techniques of chemically induced dimerization (“CID”).

BACKGROUND OF THE INVENTION

Affinity chromatography has long been used to identify the proteintargets of small-molecule drugs and other biomolecules. While anessential tool for biochemical research, affinity chromatography canoften be labor intensive and time consuming. Recently the yeastthree-hybrid assay, a derivative of the two-hybrid assay, was introducedas a straightforward, in vivo alternative to affinity chromatography(1,2). The yeast two-hybrid system relies on the interaction of twofusion proteins to bring about the transcriptional activation of areporter gene thus identifying protein-protein interactions in an invivo system (2). The subsequently developed yeast three-hybrid systemscreens for a small molecule-protein interaction based on the principlethat small ligand-receptor interactions underlie many fundamentalprocesses in biology and form the basis for pharmacological interventionof human diseases in medicine (3). In the three-hybrid assay,protein-small-molecule interactions are detected as reconstitution of atranscriptional activator (“TA”) and subsequent transcription of areporter gene (4-7). A dimeric small-molecule ligand bridges theDNA-binding domain (“DBD”) of the TA, which is fused to the receptor forone ligand, and the activation domain (“AD”) of the TA, which is fusedto the receptor for the other ligand. For affinity chromatographyapplications, one ligand-receptor pair is used as an anchor and theother is the small-molecule-protein interaction being investigated.While the yeast three-hybrid assay is quite powerful, a bacterialequivalent would increase the number of proteins that could be tested byseveral orders of magnitude because the transformation efficiency anddoubling time of E. coli is significantly greater than that of S.cerevisiae. In addition, there may be applications where it isadvantageous to test a eukaryotic protein in a prokaryotic environmentwhere many pathways are not conserved.

However, the yeast three-hybrid assay cannot be transferred directly tobacteria. The components of the transcription machinery and themechanism of transcriptional activation differ significantly betweenbacteria and yeast. Ligand-receptor pairs often are organism specificbecause of cell permeability, toxicity, or other interactions with thecellular milieu. Bacterial two-hybrid assays have only begun to bedeveloped in the past few years (8) and to date only initial effortstoward the design of a robust bacterial three-hybrid system have beenreported (9, 10). Described below is the first robust small-moleculebacterial three-hybrid system.

SUMMARY OF THE INVENTION

This invention provides a transgenic bacterial cell comprising

-   -   (a) a dimeric small molecule which comprises a first moiety        known to bind a first receptor domain covalently linked to a        second moiety capable of binding a second receptor domain,        wherein the first and second moieties are different;    -   (b) nucleotide sequences which upon transcription encode        -   i) a first fusion protein comprising the first receptor            domain, and        -   ii) a second fusion protein comprising the second receptor            domain; and    -   (c) a reporter gene wherein expression of the reporter gene is        conditioned on the proximity of the first fusion protein to the        second fusion protein.

This invention also provides a method for identifying a molecule thatbinds a known target receptor in a bacterial cell from a pool ofcandidate molecules, comprising:

-   -   (a) forming a dimeric molecule by covalently bonding each        molecule in the pool of candidate molecules to a ligand capable        of selectively binding to a receptor;    -   (b) introducing the dimeric molecule into a bacterial cell        culture comprising bacterial cells that express a first fusion        protein which comprises the known target receptor domain against        which the candidate molecule is screened, a second fusion        protein which comprises the receptor domain to which the ligand        selectively binds, and a reporter gene wherein expression of the        reporter gene is conditioned on the proximity of the first        fusion protein to the second fusion protein;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein, bringing the two        fusion proteins into proximity so as to activate the expression        of the reporter gene;    -   (d) selecting the bacterial cell that expresses the reporter        gene; and    -   (e) identifying the small molecule that binds the known target        receptor.

This invention further provides a method for identifying an unknowntarget receptor to which a known molecule is capable of binding in abacterial cell, comprising:

-   -   (a) providing a dimeric molecule having a first ligand which has        a specificity for the unknown target receptor covalently bonded        to a second ligand capable of selectively binding to a receptor;    -   (b) introducing the dimeric molecule into a bacterial cell which        expresses a first fusion protein which comprises the unknown        target receptor domain, a second fusion protein which comprises        the receptor domain to which the second ligand selectively        binds, and a reporter gene wherein expression of the reporter        gene is conditioned on the proximity of the first fusion protein        to the second fusion protein;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein so as to activate the        expression of the reporter gene;    -   (d) selecting which bacterial cell expresses the unknown target        receptor; and    -   (e) identifying the unknown target receptor.

This invention also provides a transgenic bacterial cell comprising

-   -   (a) a dimeric small molecule which comprises a methotrexate        moiety covalently linked to a moiety capable of binding a        receptor domain;    -   (b) nucleotide sequences which upon transcription encode        -   i) a first fusion protein comprising a DHFR domain and a            first fragment of an enzyme, and        -   ii) a second fusion protein comprising the receptor domain            and a second fragment of the enzyme, wherein activity of the            enzyme is conditioned on the proximity of the first fragment            of the enzyme to the second fragment of the enzyme.

This invention further provides a method for identifying a molecule thatbinds a known target receptor in a bacterial cell from a pool ofcandidate molecules, comprising:

-   -   (a) forming a dimeric molecule by covalently bonding each        molecule in the pool of candidate molecules to a methotrexate        moiety;    -   (b) introducing the dimeric molecule into a bacterial cell        culture comprising bacterial cells that express a first fusion        protein which comprises the known target receptor domain against        which the candidate molecule is screened, and a first fragment        of an enzyme, and a second fusion protein which comprises a DHFR        receptor domain and a second fragment of the enzyme;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein, bringing the first        fragment and the second fragment of the enzyme in to proximity        so as to reconstitute the activity of the enzyme;    -   (d) selecting the bacterial cell that exhibits the activity of        the enzyme; and    -   (e) identifying the small molecule that binds the known target        receptor.

This invention also provides a method for identifying an unknown targetreceptor to which a known molecule is capable of binding in a bacterialcell, comprising:

-   -   (a) providing a dimeric molecule having a first ligand which has        a specificity for the unknown target receptor covalently bonded        to a methotrexate moiety;    -   (b) introducing the dimeric molecule into a bacterial cell which        expresses a first fusion protein which comprises the unknown        target receptor domain, and a first fragment of an enzyme, and a        second fusion protein which comprises a DHFR receptor domain and        a second fragment of the enzyme;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein, bringing the first        fragment and the second fragment of the enzyme in to proximity        so as to reconstitute the activity of the enzyme;    -   (d) selecting the bacterial cell that exhibits the activity of        the enzyme; and    -   (e) identifying the unknown target receptor.

Also provided by this invention is are kits for carrying out themethods.

DESCRIPTION OF THE FIGURES

FIG. 1. Bacterial RNA polymerase two- and three-hybrid systems. (A) Doveand Hochschild built a bacterial two-hybrid system based on theirobservation that λcI-mediated recruitment of RNA polymerase issufficient to activate gene transcription. The DNA operator of λcI isplaced upstream of the RNA polymerase binding site for a lacZ reportergene. Using the known interaction between the proteins, Gal4 andGal11^(P), as a proof of principle, they showed that co-expression ofλcI-Gal4 and αNTD-Gal11^(P) fusion proteins was sufficient to activatetranscription of the lacZ gene. λcI binds to its operator and throughthe binding of Gal4 and Gal11^(P) is effectively associated with αNTD.αNTD is the N-terminal domain of the α-subunit of RNA polymerase and isused to localize the entire RNA polymerase machinery to the RNApolymerase binding site and thus promote transcription of the lacZ gene.(B) In our bacterial three-hybrid system, the interaction between λcIand αNTD is provided by a heterodimeric small molecule, Mtx-SLF. Mtx-SLFbridges between the fusion proteins, λcI-FKBP12 (the receptor for SLF)and αNTD-DHFR (the receptor for Mtx), activating transcription of thelacZ reporter gene.

FIG. 2. Retrosynthetic analysis of Mtx-SLF.

FIG. 3. β-Galactosidase assays establish that transcriptional activationis small-molecule dependent in the bacterial three-hybrid system. X-Galindicator plates are shown for the different constructs used to verifythe three-hybrid system. Each column corresponds to strain V674E bearingplasmids expressing λcI and αNTD fusion proteins involved in the RNApolymerase two- and three-hybrid assays: 1, λcI-Gal4, αNTD-Gal11^(P); 2,λcI, αNTD; 3, λcI-FKBP12, αNTD; 4, λcI-FKBP12, αTD-DHFR; 5, λcI,αNTD-DHFR. 1 is the Gal4-Gal11^(P) direct protein-protein interactionused as a positive control. 2, 3, and 5 lack both of the necessaryreceptor proteins to test the necessity of all three components fortranscriptional activation. Plate A contains no Mtx-SLF; and plate B, 10μM Mtx-SLF. The plates are LB solid media containing 40 mg/mL X-Gal, 0.5mM IPTG, 0.5 mM tPEG, 100 mg/mL ampicillin, and 6 mg/mL chloramphenicol.

FIG. 4. The levels of small-molecule induced transcriptional activationwere quantified using liquid lacZ assays. The strains here correspondexactly to those in FIG. 2 and are used in liquid ONPG assays where thelevels of transcriptional activation can be quantitated based on theamount of reporter protein, b-galactosidase, which is produced. Thestrains are V674E bearing plasmids encoding λcI and αNTD fusionproteins: 1, λcI-Gal4, αNTD-Gal11^(P); 2, λcI, αNTD; 3, λcI-FKBP12,αNTD; 4, λcI-FKBP12, αNTD-DHFR; 5, λcI, αNTD-DHFR. The strains wereassayed in triplicate from three transformants and standard deviationsare shown. The strains are grown in LB with 0.5 mM IPTG, 100 μg/mLampicillin, 6 μg/mL chloramphenicol, and small molecules at theindicated concentration. 1 is the Gal4-Gal11^(P) direct protein-proteininteraction Mtx-SLF independent positive control. Only 4 contains bothreceptor proteins for Mtx-SLF as the other strains lack either one orboth of the receptor proteins. The last two small-moleculeconcentrations are competition assays in which an excess of one of theligands for the receptor proteins was used to compete out the positivesignal due to the three-hybrid system.

FIG. 5. The levels of transcriptional activation depend on Mtx-SLFconcentration in the bacterial three-hybrid system. The concentrationsof Mtx-SLF in the media were varied, and the levels of lacZtranscription were quantitated in liquid culture using ONPG. The strainsare V674E expressing the following λcI and αNTD fusion proteins: (♦),λcI-Gal4, αNTD-Gal11^(P) (a direct protein-protein interaction); (▬),λcI-FKBP12, αNTD (a negative control); and (▴), λcI-FKBP12, αNTD-DHFR(the three-hybrid system). The rate of ONPG hydrolysis was measured intriplicate from three different transformants after growth in LB mediacontaining 0.5 mM IPTG, 100 μg/mL ampicillin, 6 μg/mL chloramphenicol,and Mtx-SLF at the indicated concentrations. The standard deviation foreach data point is also shown.

DETAILED DESCRIPTION OF THE INVENTION

This invention provides a transgenic bacterial cell comprising

-   -   (a) a dimeric small molecule which comprises a first moiety        known to bind a first receptor domain covalently linked to a        second moiety capable of binding a second receptor domain,        wherein the first and second moieties are different;    -   (b) nucleotide sequences which upon transcription encode        -   i) a first fusion protein comprising the first receptor            domain, and        -   ii) a second fusion protein comprising the second receptor            domain; and    -   (c) a reporter gene wherein expression of the reporter gene is        conditioned on the proximity of the first fusion protein to the        second fusion protein.

The dimeric small molecule may have the formula:H1-Y-H2wherein each of H1 and H2 may be the same or different and capable ofbinding to a receptor which is the same or different with a IC₅₀ of lessthan 100 μM; and Y is a linker which may be present or absent. Each ofH1 and H2 can be capable of binding to a receptor with a IC₅₀ of lessthan 10 μM; or with a IC₅₀ of less than 1 μM; or with a IC₅₀ of lessthan 100 nM; or with a IC₅₀ of less than 10 nM; or with a IC₅₀ of lessthan 1 nM.

Each of H1 and H2 may be a methotrexate moiety, FK506 moiety, an FK506analog, a tetracycline moiety, or a cephem moiety. In a preferredembodiment, H1 or H2 is a methotrexate moiety. In another preferredembodiment, H1 or H2 is an FK506 analog. The FK506 analog may have thestructure:

In a specific embodiment, the dimeric small molecule may have thestructure:

-   -   wherein n is an integer from 1 to 20, preferably n is an integer        from 2 to 12, more preferably n is an integer from 3 to 9, in a        specific embodiment, n is 8.

The first fusion protein can further comprise a DNA binding domain, andthe second fusion protein can further comprises a transcriptionactivation domain. Alternatively, the first fusion protein can furthercomprises a transcription activation domain, and the second fusionprotein can further comprises a DNA binding domain.

The transcription activation domain can be αNTD. The DNA-binding domaincan be λcI, AraC, LexA, Gal4, or zinc fingers.

The first or the second receptor domain can be that of dihydrofolatereductase (“DHFR”), glucocorticoid receptor, FKBP12, FKBP mutants,tetracycline repressor, or a penicillin binding protein. The DHFR can bethe E. coli DHFR (“eDHFR”).

In a specific embodiment, the first fusion protein is DHFR-λcI orFKBP12-λcI; and the second fusion protein is DHFR-αNTD or FKBP12-αNTD.

The reporter gene can be Lac Z, araBAD, aada (spectinomycin resistance),his3, β-lactamase, GFP, luciferase, TetR (tetracyclin resistance), KanR(kanamycin resistance), or Cm (chloramphenicol resistance). In aspecific embodiment, the reporter gene is Lac Z.

This invention also provides a transgenic bacterial cell comprising

-   -   (a) a dimeric small molecule which comprises a methotrexate        moiety covalently linked to a moiety capable of binding a        receptor domain;    -   (b) nucleotide sequences which upon transcription encode        -   i) a first fusion protein comprising a DHFR domain, and        -   ii) a second fusion protein comprising the receptor domain;            and    -   (c) a reporter gene wherein expression of the reporter gene is        conditioned on the proximity of the first fusion protein to the        second fusion protein.

The elements of this cell are as defined previously.

This invention also provides a method for identifying a molecule thatbinds a known target receptor in a bacterial cell from a pool ofcandidate molecules, comprising:

-   -   (a) forming a dimeric molecule by covalently bonding each        molecule in the pool of candidate molecules to a ligand capable        of selectively binding to a receptor;    -   (b) introducing the dimeric molecule into a bacterial cell        culture comprising bacterial cells that express a first fusion        protein which comprises the known target receptor domain against        which the candidate molecule is screened, a second fusion        protein which comprises the receptor domain to which the ligand        selectively binds, and a reporter gene wherein expression of the        reporter gene is conditioned on the proximity of the first        fusion protein to the second fusion protein;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein, bringing the two        fusion proteins into proximity so as to activate the expression        of the reporter gene;    -   (d) selecting the bacterial cell that expresses the reporter        gene; and    -   (e) identifying the small molecule that binds the known target        receptor.

In the method, the steps (b)-(e) of the method can be iterativelyrepeated in the presence of a preparation of random small molecules forcompetitive binding with the screening molecule so as to identify amolecule capable of competitively binding the known target receptor.

The dimeric molecule in the method can be obtained from a combinatoriallibrary. The dimeric molecule in the method can comprise a ligandcapable of selectively binding to a receptor with a IC₅₀ of less than100 μM; or with a IC₅₀ of less than 10 μM; or with a IC₅₀ of less than 1μM; or with a IC₅₀ of less than 100 nM; or with a IC₅₀ of less than 10nM; or with a IC₅₀ of less than 1 nM.

The dimeric molecule in the method can comprises a methotrexate moiety,FK506 moiety, an FK506 analog, a tetracycline moiety, or a cephemmoiety. In a specific embodiment, the dimeric molecule comprises amethotrexate moiety. In another specific embodiment, the dimericmolecule comprises an FK506 analog.

The FK506 analog can have has the structure:

The first fusion protein in the method can further comprise a DNAbinding domain, and the second fusion protein further comprises atranscription activation domain. Alternatively, the first fusion proteinin the method can further comprise a transcription activation domain,and the second fusion protein further comprises a DNA binding domain.

The transcription activation domain can be αNTD.

The DNA-binding domain can be λcI, AraC, LexA, Gal4, or zinc fingers.

The first or the second fusion protein in the method can comprise areceptor domain of dihydrofolate reductase (“DHFR”), glucocorticoidreceptor, FKBP12, FKBP mutants, tetracycline repressor, or a penicillinbinding protein. The DHFR can be the E. coli DHFR (“eDHFR”).

In the method, the first fusion protein can be DHFR-λcI or FKBP12-λcI.The second fusion protein can be DHFR-αNTD or FKBP12-αNTD.

The reporter gene can be Lac Z, araBAD, aada, his3, β-lactamase, GFP,luciferase, TetR, KanR, Cm. In a specific embodiment, the reporter geneis Lac Z.

This invention also provides a method for identifying an unknown targetreceptor to which a known molecule is capable of binding in a bacterialcell, comprising:

-   -   (a) providing a dimeric molecule having a first ligand which has        a specificity for the unknown target receptor covalently bonded        to a second ligand capable of selectively binding to a receptor;    -   (b) introducing the dimeric molecule into a bacterial cell which        expresses a first fusion protein which comprises the unknown        target receptor domain, a second fusion protein which comprises        the receptor domain to which the second ligand selectively        binds, and a reporter gene wherein expression of the reporter        gene is conditioned on the proximity of the first fusion protein        to the second fusion protein;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein so as to activate the        expression of the reporter gene;    -   (d) selecting which bacterial cell expresses the unknown target        receptor; and    -   (e) identifying the unknown target receptor.

The unknown target receptor is encoded by a DNA from the groupconsisting of genomicDNA, cDNA and syntheticDNA.

The steps (b)-(e) of the method can be iteratively repeated so as toidentify the unkown target receptor.

The other elements of this method are as defined previously.

This invention also provides a kit for identifying a molecule that bindsto a known target in a bacterial cell from a pool of candidatemolecules, comprising

-   -   (a) a host bacterial cell containing a reporter gene that is        expressed only when bound to a DNA-binding domain and when in        the proximity of a transcription activation domain;    -   (b) a first vector containing a promoter that functions in the        host bacterial cell and a DNA encoding a DNA-binding domain;    -   (c) a second vector containing a promoter that functions in the        host bacterial cell and a DNA encoding a transcription        activation domain;    -   (d) a dimeric small molecule which comprises a moiety that binds        to a known receptor domain covalently linked to a candidate        molecule;    -   (e) a means for inserting into the first vector or the second        vector a DNA encoding the known receptor domain in such a manner        that the known receptor domain and an expression product of the        first or second vector are expressed as a fusion protein;    -   (f) a means for inserting into the first vector or the second        vector a DNA encoding the known target receptor in such a manner        that the known target receptor and an expression product of the        first or second vector are expressed as a fusion protein; and    -   (g) a means for transfecting the host cell with the first        vector, and the second vector, wherein binding of the dimeric        small molecule to the known target receptor results in a        measurably greater expression of the reporter gene then in the        absence of such binding.

The elements of this kit are as defined previously.

This invention further provides a kit for identifying an unknown targetreceptor to which a molecule is capable of binding in a bacterial cell,comprising

-   -   (a) a host bacterial cell containing a reporter gene that is        expressed only when bound to a DNA-binding domain and when in        the proximity of a transcription activation domain;    -   (b) a first vector containing a promoter that functions in the        host bacterial cell and a DNA encoding a DNA-binding domain;    -   (c) a second vector containing a promoter that functions in the        host bacterial cell and a DNA encoding a transcription        activation domain;    -   (d) a dimeric small molecule which comprises a moiety that binds        a known receptor domain covalently linked to a moiety against        which the unknown target is to be screened for binding;    -   (e) a means for inserting into the first vector or the second        vector a DNA encoding the known receptor domain in such a manner        that the known receptor domain and an expression product of the        first or second vector are expressed as a fusion protein;    -   (f) a means for inserting into the first vector or the second        vector a DNA encoding the unknown target receptor in such a        manner that the unknown target receptor and an expression        product of the first or second vector are expressed as a fusion        protein; and    -   (g) a means for transfecting the host cell with the first        vector, and the second vector, wherein binding of the dimeric        small molecule to the unknown target receptor results in a        measurably greater expression of the reporter gene then in the        absence of such binding.

The elements of this kit are as defined previously.

This invention yet further provides a process for screening a chemicallibrary for a molecule that binds a known target receptor, comprisingproviding a chemical library, providing a bacterial cell that expressesthe known target receptor as a part of a fusion protein, and identifyingthe molecule that binds the known target receptor in the bacterial cellby the method described above. As a result, this invention also providesa molecule identified by the method, wherein the molecule was notpreviously known to bind the known target receptor.

This invention yet further provides a process for screening a cDNAlibrary for a nucleic acid that encodes a receptor to which a knownmolecule binds, comprising providing a cDNA library, providing a dimericmolecule having two ligands, of which one ligand is the known molecule,and identifying the nucleic acid that encodes the receptor to which theknown molecule binds by the method described above. As a result, thisinvention also provides an isolated nucleic acid identified by theprocess, wherein the isolated nucleic acid was not previously known toencode a receptor to which the known molecule binds. The isolatednucleic acid can encode an enzyme or a portion thereof.

This invention also provides a transgenic bacterial cell comprising

-   -   (a) a dimeric small molecule which comprises a methotrexate        moiety covalently linked to a moiety capable of binding a        receptor domain;    -   (b) nucleotide sequences which upon transcription encode        -   i) a first fusion protein comprising a DHFR domain and a            first fragment of an enzyme, and        -   ii) a second fusion protein comprising the receptor domain            and a second fragment of the enzyme, wherein activity of the            enzyme is conditioned on the proximity of the first fragment            of the enzyme to the second fragment of the enzyme.

The elements of the cell are as described above.

This invention also provides a method for identifying a molecule thatbinds a known target receptor in a bacterial cell from a pool ofcandidate molecules, comprising:

-   -   (a) forming a dimeric molecule by covalently bonding each        molecule in the pool of candidate molecules to a methotrexate        moiety;    -   (b) introducing the dimeric molecule into a bacterial cell        culture comprising bacterial cells that express a first fusion        protein which comprises the known target receptor domain against        which the candidate molecule is screened, and a first fragment        of an enzyme, and a second fusion protein which comprises a DHFR        receptor domain and a second fragment of the enzyme;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein, bringing the first        fragment and the second fragment of the enzyme in to proximity        so as to reconstitute the activity of the enzyme;    -   (d) selecting the bacterial cell that exhibits the activity of        the enzyme; and    -   (e) identifying the small molecule that binds the known target        receptor.

The elements of this method are as defined previously.

This invention also provides a method for identifying an unknown targetreceptor to which a known molecule is capable of binding in a bacterialcell, comprising:

-   -   (a) providing a dimeric molecule having a first ligand which has        a specificity for the unknown target receptor covalently bonded        to a methotrexate moiety;    -   (b) introducing the dimeric molecule into a bacterial cell which        expresses a first fusion protein which comprises the unknown        target receptor domain, and a first fragment of an enzyme, and a        second fusion protein which comprises a DHFR receptor domain and        a second fragment of the enzyme;    -   (c) permitting the dimeric molecule to bind to the first fusion        protein and to the second fusion protein, bringing the first        fragment and the second fragment of the enzyme in to proximity        so as to reconstitute the activity of the enzyme;    -   (d) selecting the bacterial cell that exhibits the activity of        the enzyme; and    -   (e) identifying the unknown target receptor.

The elements of this method are as defined previously.

Each of the ligand halves of the dimeric small molecule may be derivedfrom a compound selected from the group consisting of steroids,hormones, nuclear receptor ligands, cofactors, antibiotics, sugars,enzyme inhibitors, and drugs.

Each of the ligand halves of the dimeric small molecule may alsorepresent a compound selected from the group consisting of3,5,3′-triiodothyronine, trans-retinoic acid, biotin, coumermycin,tetracycline, lactose, methotrexate, FK506, and FK506 analogs.

In the described methods, the screening is performed by FluorescenceAssociated Cell Sorting (FACS), or gene transcription markers selectedfrom the group consisting of Green Fluorescence Protein,LacZ-β-galagctosidases, luciferase, antibiotic resistant b-lactamases,and yeast markers.

The known moiety that binds a known receptor domain can be aMethotrexate moiety or an analog thereof. The known receptor domain canbe dihydrofolate reductase (“DHFR”) generally, or the E. coli DHFR(“eDHFR”). Alternatively, the pairing can be FK506/FKBP12, AP series ofsynthetic FK506 analogs/FKBPs, tetracycline/tetracycline repressor,cephem/penicillin binding protein. The penicillin binding domain can befrom Streptomyces R61.

Each of the methods is readily adapted to identify which substrate froma pool of candidate substrates is selected in a cell by a known enzymefor a bond forming reaction between the substrate and a known aminoacid.

The described methods, cell and kit may also be adapted to identify newprotein targets for pharmaceuticals.

The described methods, cell and kit may also be adapted for determiningthe function of a protein, further including screening with a naturalcofactor being part of the CID.

The described methods, cell and kit may also be adapted for determiningthe function of a protein, further including screening with a naturalsubstrate being part of the CID.

The described methods, cell and kit may also be adapted for screening acompound for the ability to inhibit a ligand-receptor interaction.

The foregoing embodiments of the subject invention may be accomplishedaccording to the guidance which follows. Certain of the foregoingembodiments are exemplified. Sufficient guidance is provided for askilled artisan to arrive at all of the embodiments of the subjectinvention.

Preparation and Design of Ligand Halves of the Dimeric Small Molecule

A ligand half should bind its receptor with high affinity (≦100 nM),cross cell membranes yet be inert to modification or degradation, beavailable in reasonable quantities, and present a convenient side-chainfor routine chemical derivatization that does not disrupt receptorbinding.

A commercial source of traditional, non-covalent dimeric molecules foruse in a chemically induced dimerization system is ARIAD(www.ariad.com), who call their CID “ARGENT TECHNOLOGY.” The compoundsdescribed herein as well as other commercial compounds can bederivatized for use in the bacterial three-hybrid system.

For example, methotrexate (“Mtx”) is an attractive ligand half (alsoreferred to as “chemical handle”). Mtx inhibition of dihydrofolatereductase (DHFR) is one of the textbook examples of high-affinity ligandbinding. The interaction between Mtx and DHFR is extremely wellcharacterized in the literature both biochemically and structurally.DHFR is a monomeric protein and binds Mtx with picomolar affinity (11).Even though Mtx inhibits DHFR with such high affinity, E. coli grows inthe presence of Mtx when supplemented with appropriate nutrients (12).The ability of Mtx to serve both as an antibacterial and an anticanceragent is clear evidence that Mtx has excellent pharmacokineticproperties. Mtx is known to be imported into cells via a specific folatetransporter protein. Mtx is commercially available and can besynthesized readily from simple precursors. Mtx can be modifiedselectively at its g-carboxylate without disrupting its interaction withDHFR (11, 13). Mtx is commercially available and can be modifiedselectively at its á-carboxylate without disrupting dihydrofolatereductase (DHFR) binding (11, 13). Even though Mtx inhibits DHFR with pMaffinity (12), both E. coli and S. cerevisiae grow in the presence ofMtx when supplemented with appropriate nutrients (13).

However, not all ligand halves, also referred to as chemical handles,can be readily used in a given organism without the need to modify theorganism. For instance, dexamethasone requires several heat-shockproteins (“HSPs”), in order to be bound by the glucocorticoid receptor(“GR”). The required HSPs occur in yeast and other eukaryotes; however,since bacteria lack these HSPs and thus GR fails to bind Dex, Dex basedsmall molecules would not readily work in a bacterial three-hybridsystem. Similarly, using Mtx in bacteria requires special adaptation ofthe bacterial cell as Mtx can actually act as an antibiotic, asdiscussed in more detail in the examples. Mtx is exported from thebacterial cell by the TolC-dependent multi-drug efflux pump (32).Knocking out the tolC gene allows Mtx-based molecules to not only enterthe bacterial cell but remain there as well. Furthermore, knocking outthyA removed the toxic effects associated with Mtx by preventing thebio-accumulation of the toxic dihydrofolate substrate of DHFR (32).However, as discussed in detail in the example, knocking out the thyA,e.g through a mutation, is not necessary at the concentrations of Mtxthat is used in the described bacterial three-hybrid system. While touse Mtx in the exemplified system, one must only knock out the tolCgene/protein in order to assure that Mtx-based molecules remain in thecell, it is clearly possible to also knock out thyA for higherconcentrations of Mtx.

Other ligand halves may be, for example, steroids; enzyme inhibitors,such as Methotrexate used herein; drugs, such as FK506; hormones, suchas the thyroid hormone 3,5,3′-triiodothyronine; ligands for nuclearreceptors may be retinoic acids; general cofactors, such as Biotin; andantibiotics, such as Coumermycin (which can be used to induce proteindimerization according to Perlmutter et al., Nature 383, 178 (1996)).

One, or both, of the ligand halves may also be a moiety that is asubstrate for an enzyme, i.e. a moiety that would not on its own bind toa protein, but would require an enzyme to assist bonding to a protein.In this way, the system can be made dependent on an enzyme, and wouldonly operate when an enzyme is present. Such an enzyme would form acovalent bond between the small non-peptide molecule and a protein. Thiswould provide flexibility in the system by allowing for one of thenon-covalent interactions to be replaced by a covalent interaction.

Similarly, one, or both, of the ligand halves may be a moiety thatspontaneously seeks out and forms a covalent bond with a receptor. Anexample of this is the interaction between Fluorouracil and ThymidylateSynthase, and another between Cephen and the Penicillin Binding Protein.This would also provide flexibility in the system by allowing for one ofthe non-covalent interactions to be replaced by a covalent interaction.

Linkage of the Ligand Halves in the Dimeric Small Molecule

While the ligand halves can be simply linked by a covalent bond betweenthe two of them, more elaborate linkages may also be used depending onthe screen to be performed. The linkage may be formed by any of themethods known in the art (14, 15). Descriptions of linkage chemistriesare also provided by WO 94/18317, WO 95/02684, WO 96/13613, WO96/06097,and WO 01/53355, these references being incorporated herein byreference. The linkers are all commerciaily available or can be preparedin a single step. The linkers vary in hydrophobicity, length, andflexibility.

The linker may be designed to respond to enzymatic activity. Forexample, a linker can contain a glycosidase bond, which may be cleavedby a glycosidase enzyme and formed by a Glycosyltransferase enzyme; oran amide bond, which may be cleaved by a protease and formed bypeptidase or transpeptidase; or an aldol product bond, which is cleavedby a retro-aldolase and formed by aldolase; or an ester bond; or aphosphodiester bond. Such bonds can be used in .bacterial based screenssimilarly to their use in yeast based screens, which are described in WO01/53355.

The enzymes that act on such linkers may be known enzymes or novelproteins which are being screened for specific enzymatic activity. Novelenzymes can be evolved using combinatorial techniques.

Once a desired substrate is selected and formed into the dimeric smallmolecule, a large number of enzymes and derivatives of enzymes can bescreened. A variety of enzymes and enzymes classes are listed on theWorld Wide Web beginning at prowl.rockefeller.edu/enzymes/enzymes.htm.All enzymes are given an Enzyme Commission (E.C.) number allowing it tobe uniquely identified. E.C. numbers have four fields separated byperiods, “a.b.c.d”. The left-hand-most field represents the most broadclassification for the enzyme. The next field represents a finerdivision of that broad category. The third field adds more detailedinformation and the fourth field defines the specific enzyme. Thus, inthe “a” field the classifications are oxidoreductases, transferases,hydrolases, lyases, isomerases, and ligases. Each of these “a”classifications are then further separated into corresponding “b”classification, each of which in turn is separated into corresponding“c” classifications, which are then further separated into corresponding“d” classes.

Moreover, new enzymes are discovered and are intended to be includedwithin the scope of this invention, which is itself designed to evolveor discover such new enzymes.

Design of the Protein Chimeras

The second important feature is the design of the protein chimeras. Anumber of chimeras are discussed in detail in WO 01/53355.

The described bacterial three-hybrid system is based on the RNApolymerase two-hybrid system reported by Dove and Hochschild in 1997(16). A variety of methods for detecting protein-protein interactions inbacteria are now available. (8-10, 16-19). Generally, these methods arebased either on enzyme complementation or transcriptional activation orrepression assays. While the enzyme complementation assays areessentially the same as those used in eukaryotes, entirely newtranscription-based assays had to be developed for bacteria because thecomponents of the transcription machinery are poorly conserved betweeneukaryotes and prokaryotes. The choice to adapt the RNA polymerase assaydeveloped by Dove and Hochschild was because transcriptional activationin this assay results in a large increase in reporter gene transcriptionand because reconstitution of transcriptional activation was expected tobe largely conformation independent. Based on their studies of themechanism of transcriptional activation by λ-repressor (λcI) (20), Doveand Hochschild developed an in vivo assay for protein-proteininteractions based on dimerization of λcI and the N-terminal domain ofthe α-subunit of RNA polymerase (αNTD). Specifically, they designed areporter construct consisting of a λcI operator upstream of the TATA boxfor the lacZ gene, which encodes β-galactosidase. They then preparedplasmids encoding a Gal4-λcI fusion protein and a Gal11^(P)-αNTD fusionprotein. Gal4 and Gal11^(P) are two proteins known to interact with highaffinity and so provided a well-established test case. Finally, theinteraction between Gal4-λcI and Gal11^(P)-αNTD was shown to activatetranscription of the lacZ reporter gene (21). This originalGal4-Gal11^(P) two-hybrid system is used as a small-molecule independentpositive control in the example provided below.

Design of Reporter Genes

A reporter gene assay measures the activity of a gene's promoter. Ittakes advantage of molecular biology techniques, which allow one to putheterologous genes under the control of a bacterial cell. Activation ofthe promoter induces the reporter gene as well as or instead of theendogenous gene. By design, the reporter gene codes for a protein thatcan easily be detected and measured. Commonly it is an enzyme thatconverts a commercially available substrate into a product. Thisconversion is conveniently followed by either chromatography or directoptical measurement and allows for the quantification of the amount ofenzyme produced.

Reporter genes are commercially available on a variety of plasmids forthe study of gene regulation in a large variety of organisms (22).Promoters of interest can be inserted into multiple cloning sitesprovided for this purpose in front of the reporter gene on the plasmid(23, 24). Standard techniques are used to introduce these genes into acell type or whole organism (e.g., as described in Sambrook, J.,Fritsch, E. F. and Maniatis, T. Expression of cloned genes in culturedmammalian cells. In: Molecular Cloning, edited by Nolan, C. New York:Cold Spring Harbor Laboratory Press, 1989). Resistance markers providedon the plasmid can then be used to select for successfully transfectedcells.

Ease of use and the large signal amplification make this techniqueincreasingly popular in the study of gene regulation. Every step in thecascade DNA -->RNA -->Enzyme -->Product -->Signal amplifies the next onein the sequence. The further down in the cascade one measures, the moresignal one obtains.

In an ideal reporter gene assay, the reporter gene under the control ofthe promoter of interest is transfected into cells, either transientlyor stably. Receptor activation leads to a change in enzyme levels viatranscriptional and translational events. The amount of enzyme presentcan be measured via its enzymatic action on a substrate.

Reconstitution of Enzyme Activity

In lieu of the protein chimeras or reporter genes as described above,the reassembly of an enzyme, and thus its activity, can be used as areporter system. Complementation between enzyme fragments has beenobserved in numerous cases since the pioneering discovery ofintracistronic complementation by Ullmann and colleagues in 1967. Theso-called á-complementation between two truncated β-galactosidasepolypeptides became a classical tool in molecular biology for cloningtechniques and has recently found new interest in cell biology as asensitive marker of eukaryotic cell fusions and a reporter.

Potentially, a large number of enzymes might be split into twocomplementary fragments that could reassociate to reconstitute thenative enzymatic activity. Functional complementation between two enzymefragments can be exploited to design a three-hybrid system providedthat: (i) the cognate enzymatic activity is easily detected or selectedfor in vivo; (ii) the two fragments do not reassociate spontaneouslywhen expressed as separate entities (as do, for example, the classicalLacZ-á and á-acceptor); (iii) when fused to interacting polypeptides,the two fragments reassociate in an enzymatically active complex. Avariety of enzyme reassembly systems have been described (37). Forexample, a protein complementation assay based on engineered fragmentsof mouse dihydrofolate reductase (mDHFR) has been described (38).Subsequently, the same group used the mDHFR complementation assay toselect pairs of leucine zippers that selectively heterodimerize.

One advantage of an enzyme reconstitution assay is that it can work incells which do not endogenously express the enzyme, thus providing anefficient reporter system.

This invention will be better understood from the Experimental Detailswhich follow. However, one skilled in the art will readily appreciatethat the specific methods and results discussed are merely illustrativeof the invention as described more fully in the claims which followthereafter.

EXPERIMENTAL DETAILS

Part I—Preparation of Dimeric Small Molecule Ligand

In order to convert the Dove and Hoschild two-hybrid assay into athree-hybrid system, it was initially important to design a dimericligand that could bridge λcI and αNTD via the ligand's receptors. Forthe bridging small molecule, a heterodimer of methotrexate (Mtx) and asynthetic analog of FK506 (SLF) was prepared. This heterodimer isreferred to in this description Mtx-SLF. Mtx-SLF was used to dimerize aλcI-FK506 binding protein 12 (λcI-FKBP12) protein chimera and anαNTD-dihydrofolate reductase (αNTD-DHFR) protein chimera as shown inFIG. 1. Methotrexate (Mtx) inhibits dihydrofolate reductase (DHFR) witha low picomolar K_(I), and the interaction between the two has beenextensively studied (11, 25). In addition, our laboratory recentlyshowed Mtx could be used successfully in a yeast three-hybrid system (7,26). For the other half of the bridging small molecule, we used SLF,available from Ariad Pharmaceuticals as an FK506 analog. SLF hasnanomolar affinity for FKBP12, and the interaction between the two hasbeen fully characterized (27, 28). In addition, SLF homodimers have beenused previously in several mammalian three-hybrid systems (29).

The retrosynthetic analysis of Mtx-SLF is shown in FIG. 2. The synthesisis based on previous syntheses of Mtx and SLF derivatives and wasdesigned to allow Mtx, SLF, or the linker between them to be variedreadily. The Mtx portion of the molecule begins as the γ-methyl ester ofL-glutamic acid and is based on previous syntheses of Mtx (7, 26, 30).γ-Methyl L-glutamic acid is inexpensive, and the α-carboxylate can beselectively protected as the -butyl ester by transiently protonating theα-amino group (31). The diprotected amino acid is then coupled to4-(methylamino)benzoic acid using standard peptide coupling reagents.Finally, the γ-methyl ester is saponified to yield the free acid forfurther reactions. SLF acid was synthesized as described previously fromL-pipecolinic acid in 59% yield for 6 steps (6, 27). The Mtx and SLFportions were then coupled to 1,10-diaminodecane in a three-componentpeptide coupling reaction. 2,4-Diamino-6-bromomethyl-pteridine is addedafter this coupling reaction in order to simplify purification of thesynthetic intermediates. Finally, acid cleavage of the tert-butyl esteryielded Mtx-SLF. Thus, the Mtx-SLF heterodimer was prepared from twocomponents in 5% overall yield for the 6 steps from the γ-methyl esterof L-glutamic acid or 6% overall yield in 9 steps from the L-pipecolinicacid precursor of SLF.

The synthetic chemistry performed for the preparation of the Mtx-SLF isdescribed below.

Synthetic Chemistry

General Methods. Reagents were obtained from commercial suppliers andwere used without further purification. All reagents for chemicalsynthesis were purchased from Aldrich. Anhydrous N,N-dimethylformamideand anhydrous methylene chloride were from Sure Seal™ bottles purchasedfrom Aldrich. Methotrexate was a generous gift from the National CancerInstitute. Nuclear magnetic resonance (NMR) spectra were recorded on aBruker 500 (500 MHz), a Bruker 400 (400 MHz) or a Bruker 300 (300 MHz)Fourier Transform (FT) NMR spectrometer at the Columbia UniversityChemistry Department NMR Facility. Spectra were determined inmethanol-d₄ at 300 K with the proton or carbon (3.30δ; 49.0δ) as thereference or in chloroform-d at 300K with proton (7.26δ) as thereference. ¹H NMR spectra are tabulated in the following order: chemicalshift calculated with reference to solvent standards based ontetramethylsilane, multiplicity (s, singlet; d, doublet; t, triplet; m,multiplet; br, broad), coupling constant(s) in Hertz, and number ofprotons. ¹³C NMR spectra were determined on the Bruker 300 MHzinstrument and are proton decoupled. Mass spectra (MS) were recorded atthe Columbia University Department of Chemistry mass spectrallaboratory. Fast Atom Bombardment (FAB) high resolution mass spectra(HRMS) were recorded on a JMS-HX110A mass spectrometer. Low resolutionelectron spray ionization mass spectra (LRMS) were recorded on a JMS-LCMate mass spectrometer. Analytical thin layer chromatography (TLC) wasperformed on silica gel (Whatman LHPKF Silica Gel 60 Å) and visualizedby UV light (254 nm) or stained by ninhydrin. All column chromatographywas flash chromatography carried out on silica gel (EM Science SilicaGel 60), and all eluants used are reported in volume:volume ratios. Allmoisture-sensitive reactions were performed under a positive pressure ofnitrogen in flame- or oven-dried glassware. Organic extracts were driedover anhydrous sodium sulfate. Organic solvents were removed in vacuowith a rotary evaporator equipped with a vacuum pump (ca. 1 torr).Products were then left under vacuum (ca. 0.1 torr) overnight beforeanalysis was performed.

Scheme I

Synthesis of 2. (31) The γ-methyl ester of L-glutamic acid (1) (5.02 g,31.0 mmol) was added to a solution of 70% aqueous (aq.) perchloric acid(3.0 mL) in tert-butyl acetate (200 mL). The resulting solution wasstirred at room temperature (rt) for 3 h during which time the aciddissolves completely. The reaction was then judged complete by thinlayer chromatography (TLC) using 10:1 methylene chloride(CH₂Cl₂):methanol (MeOH). The reaction mixture was extracted with 0.5 Naq. HCl (4×, 400 mL). The pH of the combined aqueous layers was adjustedto 8 using saturated aq. sodium carbonate. The aqueous solution was thenextracted with ethyl acetate (EtOAc) (4×, 500 mL). The organic extractswere combined, washed with brine (2×, 300 mL), and dried over anhydroussodium sulfate. Removing the solvent in vacuo gave 2 as a clear oil in65% yield: R_(f)=0.45 in 10:1 CH₂Cl₂:MeOH; ¹H NMR (400 MHz, CD₃OD) δ3.66 (s, 3), 3.33 (t, J=6.5 Hz, 1), 2.41 (m, 2), 1.95 (m, 1), 1.86 (m,1); LRMS, m/z 218.2 (MH⁺), 219.2 (MH₂ ⁺).

Synthesis of 3. (7, 30) Compound 2 (2.19 g, 10.0 mmol),1,3-dicyclohexylcarbodiimide (D 3.09 g, 15.0 mmol),1-hydroxylbenzotriazole hydrate (HOBt, 2.43 g, 18.0 mmol), andN-methyl-para-benzoic acid (1.59 g, 10.5 mmol) were dissolved inanhydrous dimethyl formamide (DMF, 22 mL) under a nitrogen atmosphere.Diisopropylethylamine (DIEA, 0.1 mL, 0.5 mmol) was added to thesolution, and the reaction mixture was stirred overnight (ON) at rt.After 16 hr, a 1:2:1 water:saturated aq. sodium bicarbonate:brinesolution (500 mL) was added to the reaction giving a yellow suspension.This solution was then extracted with EtOAc (4×, 300 mL). The fractionswere combined, washed with brine (2×, 200 mL) and dried over anhydroussodium sulfate. The organic solvent was then removed in vacuo. Theproduct is purified by silica gel column chromatography (2:1 to 1:1hexanes:EtOAc) in 76% yield: R_(f)=0.25 in 1:1 hexanes:EtOAc; ¹H NMR(500 MHz, CD₃OD) δ 7.66 (d, J=7.0 Hz, 2), 6.58 (d, J=7.0 Hz, 2), 4.59(dd, J=9.5, 5.0 Hz, 1), 3.65 (s, 3), 2.79 (s, 3), 2.47 (t, J=7.5 Hz, 2),2.23 (m, 1), 2.05 (m, 1), 1.46 (S, 9); ¹³C NMR (300 MHz, CD₃OD) δ 175.1,173.0, 170.6, 154.6, 130.2, 121.5, 111.9, 82.9, 54.3, 52.3, 31.4, 30.0,28.3, 27.6; LRMS, m/z 351.2 (MH⁺); HRMS, m/z 351.1930 (MH⁺), calculated351.1920.

Synthesis of 4. Compound 3 (500 mg, 1.43 mmol) was dissolved in methanol(20 mL). Lithium hydroxide monohydrate (120 mg, 2.86 mmol) was dissolvedin water. Both solutions were chilled in a 0° C. ice bath. The aqueoussolution was added to the methanol solution all at once. The resultingsolution was stirred at 0° C. for 10 minutes and then allowed to warm tort and stirred for an additional 80 minutes. Solvent is removed in vacuountil only a yellow gel remained with a volume about 1 mL. Water (20 mL)was added to the remaining reaction mixture. The solution was acidifiedto pH=2 with 1 N aq. HCl (9 mL) and was immediately extracted with EtOAc(5×, 25 mL). The organic extracts were combined, washed with brine (2×,20 mL), and dried over anhydrous sodium sulfate. The solvent was removedin vacuo to yield product 4 in 93% yield: R_(f)=0.05 in 1:1EtOAc:hexanes and 0.45 in 5:1 CH₂Cl₂:MeOH; ¹H NMR (400 MHz, CD₃OD) δ7.64 (d, J=7.0 Hz, 2), 6.56 (d, J=7.0 Hz, 2), 4.46 (dd, J=9.0, 5.0 Hz,1), 2.80 (s, 3), 2.44 (t, J=7.5 Hz, 2), 2.23 (m, 1), 2.05 (m, 1), 1.47(s, 9); ¹³C NMR (300 MHz, CD₃OD) δ 178.4, 174.8, 172.3, 156.2, 130.7,123.2, 113.6, 84.6, 57.0, 33.2, 31.8, 30.5, 29.1; LRMS, m/z 337.3 (MH⁺);HRMS, m/z 337.1751 (MH⁺), calculated 337.1763.

Scheme II

Synthesis of 7. (27) Synthesized as reported in quantitative yield togive 7 as a yellow solid: R_(f)=0.40 in 1:1 EtOAc:hexanes; ¹H NMR (300MHz, CDCl₃) δ 9.86 (s, 1), 7.61-7.55 (m, 2), 7.40-7.34 (m, 2), 7.32 (d,J=4.5 Hz, 1), 7.25-7.10 (m, 2), 6.91-6.85 (m, 2), 3.95 (s, 3), 3.93 (s,3); LRMS, m/z 284.9 (MH⁺).

Synthesis of 8.(27) Synthesized as reported in near quantitative yieldbased on mass as a white crystalline solid (NMR revealed a small amountof non-hydrogenated starting material remains and is carried through tothe next step. NMR integration is used to determine relative quantitiesof the two materials.): R_(f)=0.50 in 1:1 EtOAc:hexanes; ¹H NMR (300MHz, CDCl₃); δ 7.55-7.48 (m, 2), 7.35 (t, J=8.0 Hz, 1), 7.25 (t, J=8.5Hz, 1), 7.10 (br d, J=8.0 Hz, 1), 6.91-6.73 (m, 3), 3.90 (s, 3), 3.88(s, 3), 3.24 (t, J=7.5 Hz, 2), 2.99 (t, J=8.0 Hz, 2); LRMS, m/z 287.0(MH⁺).

Synthesis of 9. (27) Synthesized as reported to give 9 as a clear oil inquantitative yield by mass: R_(f)=0.60 in 1:1 EtOAc:hexanes; ¹H NMR (300MHz, CDCl₃) δ 7.59 (br d, 8.0 Hz, 1), 7.49 (m, 1), 7.37 (t, J=8.0 Hz,1), 7.14 (dd, J=8.0, 2.5 Hz, 1), 6.85-6.72 (m, 3), 4.60 (s, 2), 3.89 (s,3), 3.86 (s, 3), 3.27 (t, J=7.1 Hz, 2), 3.04 (t, J=7.5 Hz, 2), 1.50 (s,9).

Synthesis of 10.(27) Synthesized as reported to yield product 10 in 20%yield fully purified. However, a 75% yield of unreacted startingmaterial was also recovered. R_(f)=0.40 in 5:1 CH₂Cl₂:MeOH and 0.40 in1:1 EtOAc:hexanes; ¹H NMR (500 MHz, CD₃OD) δ 7.28 (t, J=8.0 Hz, 1), 6.93(d, J=7.5 Hz, 1), 6.89 (s, 1), 6.82 (d, J=8.0 Hz, 1), 6.80-6.74 (m, 2),6.70 (d, J=8.0 Hz, 1), 4.60-4.53 (m, 3), 3.80 (s, 3), 3.78 (s, 3),2.65-2.51 (m, 2), 2.04-1.89 (m,2), 1.47 (s, 9); LRMS, m/z 403.3 (MH⁺).

Scheme III

Synthesis of 12.(28) Synthesized as reported to give 12 as a whitecrystalline solid in quantitative yield based on NMR integration: ¹H NMR(400 MHz, CD₃OD) δ 4.03 (br d, J=9.5 Hz, 1), 3.83 (s, 3), 3.41 (br d,J=12.0 Hz, 1), 3.04 (br t, J=11.0 Hz, 1), 2.27 (br d, J=11.5 Hz, 1),2.00-1.63 (m, 5).

Synthesis of 13.(28) Synthesized as reported to yield 13 as a clear oilin 86% yield: R_(f)=0.70 in 1:1 EtOAc:hexanes and 0.25 in 1:4EtOAc:hexanes; ¹H NMR (500 MHz, CD₃OD) δ 5.15 (br d, J=5.0 Hz, 0.7),4.62 (br d, J=4.0 Hz, 0.3), 4.34 (br d, J=12.0 Hz, 0.3), 3.89 (s, 2.1),3.84 (s, 0.9), 3.77 (s, 3), 3.57 (br d, J=14.0 Hz, 0.7), 3.35 (m, 0.7),2.91 (br t, J=13.0 Hz, 0.3), 2.35-2.24 (m, 1), 1.83-1.65 (m, 3)1.55-1.35 (m, 2) This product exists as a 2.5:1 mixture of the trans andcis conformations. Further analysis by COSY allows us to make thefollowing assignments of the two peaks for each proton in the structure:5.15 and 4.62, 4.34 and 3.57, 3.89 and 3.84, 3.35 and 2.91; LRMS, m/z230.1 (MH₁ ⁺).

Synthesis of 14.(28) Synthesized as reported to give 14 as a clear oilin 81% yield: R_(f)=0.85 in 1:1 EtOAc:hexanes and 0.50 in 1:4EtOAc:hexanes; ¹H NMR (500 MHz, CD₃OD) δ 5.16 (br d, J=5.5 Hz, 0.8),4.38 (br d, J=11.5 Hz, 0.2), 4.24 (br d, J=5.0 Hz, 0.2), 3.75 (s, 3),3.39 (br d, J=13.0 Hz, 0.8), 3.23 (td, J=13.0, 2.8 Hz, 0.8), 2.91 (br t,J=13.0 Hz, 0.2), 2.30 (br d, J=14.0 Hz, 0.8), 2.22 (br d, J=13.5, 0.2),1.79-1.60 (m, 5) 1.55-1.35 (m, 2), 1.23-1.13 (m, 6), 0.86 (t, J=7.5 Hz,3) This product exists as a 4:1 mixture of the trans and cisconformations. Further analysis by COSY allows us to make the followingassignments of the two peaks for each proton in the structure: 5.16 and4.24, 4.38 and 3.39, 3.23 and 2.91, 2.30 and 2.22; LRMS, m/z 270.2(MH⁺).

Synthesis of 15.(28) Synthesized as reported to give the product 15 as awhite crystalline material in 96% yield: R_(f)=0.05 in 1:1EtOAc:hexanes; ¹H NMR (500 MHz, CDCl₃) δ 5.29 (br d, J=5.5 Hz, 0.8),4.50 (br d, J=14.0 Hz, 0.2), 4.26 (br d, J=4.5 Hz, 0.2), 3.43 (br d,J=13.0 Hz, 0.8), 3.24 (td, J=12.5, 3.5 Hz, 0.8), 2.94 (br td, J=13.0, 3Hz, 0.2), 2.34 (br d, J=12.0 Hz, 0.8), 2.24 (br d, J=13.5, 0.2),1.79-1.60 (m, 5) 1.55-1.35 (m, 2), 1.23-1.13 (m, 6), 0.86 (t, J=7.5 Hz,3) This product exists as a 4:1 mixture of the trans and cisconformations. Further analysis by COSY allows us to make the followingassignments of the two peaks for each proton in the structure: 5.29 and4.24, 4.50 and 3.43, 3.24 and 2.94, 2.34 and 2.24. Compounds 16, 17, 18,and Mtx-SLF all have this same 4:1 conformation pattern and appearnearly the same spectroscopically as compound 15 for these peaks. Forsimplicity's sake the 0.8 integration will be called 1 and the 0.2 peakwill be disregarded in the characterization for the rest of thesecompounds.

Scheme IV

Synthesis of 16.(27) Synthesized as reported to give 16 as a colorlessoil in 88% yield: R_(f)=0.15 in 4:1 hexanes:EtOAc and 0.65 in 1:1EtOAc:hexanes; ¹H NMR (500 MHz, CD₃OD) δ 7.28 (t, J=8.0 Hz, 1), 6.96 (m,1), 6.89 (s, 1), 6.85 (m, 2), 6.78 (s, 1), 6.71 (d, J=8.0 Hz, 1), 5.73(m, 1), 5.21 (br d, J=5.5 Hz, 1), 4.58 (s, 2), 3.80 (s, 3), 3.78 (s, 3),3.38 (br d, J=14.5 Hz, 1), 3.17 (td, J=13.0, 3.0 Hz, 1), 2.65-2.52 (m,2), 2.32 (br d, J=13.0 Hz, 1), 2.30-2.20 (m, 1), 2.05 (p, J=7.0 Hz, 1),1.78-1.58 (m, 5), 1.46 (s, 9), 1.41-1.26 (m, 2), 1.23 (s, 3), 1.21 (s,3), 0.86 (t, J=7.5 Hz, 3) (see note in Compound 15); LRMS, m/z 640.7(MH⁺).

Synthesis of 17. (27) Synthesized as reported to give 17 in quantitativeyield. TLC analysis showed only one product and the acid was usedwithout further purification: R_(f)=0.05 in 1:1 EtOAc:hexanes and 0.35in 10:1 CH₂Cl₂:MeOH.

Scheme V

Synthesis of 18. Compound 4 (42.7 mg, 0.127 mmol), compound 17 (74.1 mg,0.127 mmol), 1,10-diaminodecane (20.7 mg, 0.120 mmol), DCC (130 mg,0.631 mmol), and DMAP (30.0 mg, 0.246 mmol) were dissolved in CH₂Cl₂ (3mL) under a nitrogen atmosphere and stirred at rt overnight. After 16hours, the solvent was removed in vacuo. The product was purified bysilica gel column chromatography (1:4 EtOAc:hexanes to pure EtOAc) togive a white solid in 30% yield: R_(f)=0.65 in EtOAc and 0.60 in 10:1CH₂Cl₂:MeOH; ¹H NMR (500 MHz, CD₃OD) δ 7.64 (d, J=7.0 Hz, 2), 7.28 (td,J=8.0, 3.0 Hz, 1), 6.98 (m, 2), 6.91 (d, J=7.5 Hz, 1), 6.85 (d, J=8.0Hz, 1), 6.76 (s, 1), 6.69 (d, J=8.0 Hz, 1), 6.56 (d, J=7.0 Hz, 2), 5.74(m, 1), 5.21 (br d, J=5.0 Hz, 1), 4.51 (s, 2), 4.46 (m, 1), 3.80 (s, 3),3.78 (s, 3), 3.38 (br d, J=12.5 Hz, 1), 3.23 (t, J=7.0 Hz, 2) 3.17 (td,J=13.0, 3.0 Hz, 1), 3.11 (t, J=7.0 Hz, 2), 2.80 (s, 3), 2.65-2.52 (m,2), 2.37-2.30 (m, 3), 2.30-2.15 (m, 2), 2.10-2.00 (m, 2), 1.78-1.58 (m,4), 1.48 (s, 9), 1.54-1.38 (m, 5), 1.38-1.28 (m, 2), 1.28-1.16 (m, 18),0.86 (t, J=7.5 Hz, 3) (see note in Compound 15).

Synthesis of Mtx-SLF. (7, 26, 30) Compound 18 (40.0 mg, 38.0 μmol) andthe hydrobromide salt of 2,4-diamino-6-bromomethyl pteridine (18 mg, 45μmol) were dissolved in N,N′-dimethyl acetamide (2 mL). The reactionmixture was stirred in a 50° C. oil bath for 12 hours. The intermediateproduct (R_(f)=0.50 in 10:1 CH₂Cl₂:MeOH) was purified by silica gelcolumn chromatography (30:1 to 10:1 CH₂Cl₂:MeOH). The crude product wasdissolved in trifluoroacetic acid (3 mL) at 0° C. for 5 minutes andallowed to warm to rt and stirred at rt for 1 hour. Toluene (3×, 50 mL)was added to the reaction mixture, and all solvent was removed in vacuo.After removal of solvents, Mtx-SLF was purified by preparative thinlayer silica gel chromatography (5:1 CH₂Cl₂:MeOH, 4×) to give a yellowsolid in 33% yield (for two steps): R_(f)=0.15 in 3:1 CH₂Cl₂:MeOH; ¹HNMR (500 MHz, CD₃OD) δ 8.55 (s, 1), 7.73 (d, J=8.0 Hz, 2), 7.28 (t,J=7.5 Hz, 1), 6.98 (m, 2), 6.91 (br d, J=7.5 Hz, 1), 6.85-6.79 (m, 3),6.76 (s, 1), 6.69 (d, J=6.5 Hz, 1), 5.73 (m, 1), 5.21 (br s, 1), 4.75(s, 2), 4.50 (s, 2), 4.46 (m, 1), 3.80 (s, 3), 3.78 (s, 3), 3.38 (br d,J=13.0 Hz, 1), 3.25-3.10 (m, 6), 3.05 (t, J=7.0 Hz, 2), 2.65-2.52 (m,2), 2.37-2.15 (m, 5), 2.10-2.00 (m, 2), 1.78-1.58 (m, 4), 1.50-1.42 (m,3), 1.38-1.32 (m, 3), 1.35-1.25 (m, 4), 1.25-1.16 (m, 15), 0.86 (t,J=7.5 Hz, 3) (see note in Compound 15).

Small molecule concentration calibration. The small molecules weredissolved in DMF to concentrations of 10 mM for Mtx and 12 mM for theMtx-SLF molecule. The concentrations of Mtx and Mtx-SLF were determinedby Beer's law using an extinction coefficient of ε=6700 cm⁻¹M⁻¹(calculated from a known solution of Mtx in DMF) for Mtx-SLF. Solutionsof compound 16 (SLF-OtBu) were prepared on a sufficient scale to mass 16accurately. All small molecules were stored under a nitrogen atmosphereat −80° C. and allowed to come to rt before use.

Part II—Construction of E. coli Strain.

The next step was the construction of the E. coli strain expressing theλcI-FKBP12 and αNTD-DHFR fusion proteins and containing the lacZreporter construct. Plasmids encoding the λcI-FKBP12 and αNTD-DHFRchimeras were prepared from vectors pACλcI32 and PBRαLN using standardmolecular biology techniques (8). The same synthetic lacZ reporter,placOR2-62, as initially reported by the Hochschild lab was used. Thereporter placOR2-62 is maintained in one copy in the chromosome as aprophage and encodes the lacZ gene 62 bp downstream from the λcIoperator (FIG. 1) (16). Based on previous results from Kopytek and Hushowing that tolC⁻ and thyA⁻ mutations improved the viability an dtolerance of E. coli to Mtx-based molecules, we expected export as wellas toxicity of Mtx-SLF to be problematic in E. coli (32). Thus, wemodified the original Hochschild strain KS1 to be tolC⁻ in order todecrease active export of our small molecule. At the low concentrationsof Mtx-SLF required for the three-hybrid experiments, however, Mtx wasnot sufficiently toxic to warrant the thyA⁻ mutation. We introduced thetolC⁻ mutation into KS1 via a Plvir transduction from strain SK037 (32).We call this test strain V674E. Transformation of the plasmids bearingthe various ^(λ)cI and ^(α)NTD fusion proteins into V674E yielded thefinal experimental strains.

The molecular biology performed for the preparation of the strain isdescribed below.

General methods. Restriction enzymes, Vent DNA polymerase and T 4 DNAligase were purchased from New England Biolabs. The dNTPs used in thePolymerase Chain Reation (PCR) were purchased from Pharmacia Biotech.Oligonucleotides were purchased from The Great American Gene Company(www.geneco.com). The bacto-agar, tryptone-peptone and bacto-yeastextract were purchased from DIFCO. Falcon 14 mL culture tubes were usedfor growing bacteria. Corning Costar 96-well plates with V-shaped wellsused for growing the bacteria for solid media plate assays. The phrogused to transfer cells into 96-well plates or onto petri platescontaining agar media was purchased from Dan-Kar Corp. (Wilmington,Mass.). 5-Bromo-4-chloro-3-indolyl-β-D-galactopyranoside (X-gal) for theplate assays was purchased from Diagnostic Chemicals (Oxford, Conn.).o-Nitrophenyl-β-D-galactopyranoside (ONPG) for the liquid assays waspurchased from Sigma. Methotrexate was from the National CancerInstitutes (NCI). All other chemicals were purchased from Aldrich orSigma. All aqueous solutions were made with distilled water preparedfrom a Milli-Q Water Purification System. For Polymerase Chain Reaction(PCR), a MJ Research PTC-200 Pellier Thermal Cycler was employed. Thetransformation of E. coli was carried out by electroporation using aBio-Rad E. coli Pulser. UV/Vis spectra and absorbances were taken usinga Perkin-Elmer Lambda 6 Spectraphotometer. Restriction digests werecarried out as recommended by New England Biolabs. Sequencing of thereceptor domains of all plasmids constructed was performed by Gene Wiz(New York, N.Y.). All other standard molecular biology techniques werecarried out essentially as described (33-35).

The bacterial two-hybrid plasmids, pAC^(λ)cI32, pBR^(α)LN, pACLGF2, andpBR^(α)Gal11^(P), and E. coli strain KS1 were obtained (8, 21). Plvirand E. coli strain SK037 were obtained (32); and plasmid pJG-FKBP12 wasobtained (2). Map of pBRaáN with TEM1 follows.

Construction of λcI-FKBP12 and αNTD-DHFR protein chimeras. The geneencoding dihydrofolate reductase (DHFR) from E. coli was subcloned fromplasmid pMW102eDHFR to pBRαLN (7, 26). A 531 bp NotI to BamHI fragmentwas prepared by polymerase chain reaction (PCR) from pMW102eDHFR (26)using the primers 5′-ACT TCA GGT GCG GCC GCA GGC TCG GGC GGC TCG GGC GGCGGA GTG CAG GTG GAA AC-3′ (5′ primer VWC482, flanked by a NotI site(underlined) and a Gly-Ser-Gly-Gly-Ser-Gly-Gly linker) and 5′-TGT ATCAAC GGA TCC TTA ATG GTG ATG GTG ATG GTG CGA GCC GAA TTC TTC CAG TTT TAGAA-3′ (3′ primer VWC487, flanked by a BamHI site (underlined) and a 6His tag followed by a stop codon). This fragment was inserted betweenthe NotI site and the BamHI site in pBRαLN to give to pBRαLNeDHFR. Theregion generated by PCR was verified by DNA sequencing using primersVWC628, 5′-CTG GCT GAA CAA CTG GAA GC, and VWC629, 5′-ATA TAG GCG CCAGCA ACC GC.

The gene encoding FKBP12 was subcloned from plasmid pJG-FKBP12 topACλcI32. A 384 bp Not I to Asc I fragment was prepared by polymerasechain reaction (PCR) from plasmid pJG-FKBP12 using the primers 5′-ACTTCA GGT GCG GCC GCA GGC TCG GGC GGC TCG GGC GGC GGA GTG CAG GTG GAAAC-3′ (5′ primer VWC611, flanked by a NotI site (underlined) and aGly-Ser-Gly-Gly-Ser-Gly-Gly linker) and 5′-TGT ATC AAC GGC GCG CCT TAATGG TGA TGG TGA TGG TGC GAG CCG AAT TCT TCC AGT TTT AGA A-3′ (3′ primerVWC612, flanked by a AscI site (underlined) and a 6 His tag followed bya stop codon). This fragment was inserted between the NotI site and theAscI site in plasmid pACλcI32 to give pACλcIFKBP12. The region amplifiedby PCR was verified by DNA sequencing using primers VWC626, 5′-CCC AATGAT CCC ATG CAA TG, and VWC627, 5′-GCG CTT CGT TAA TAC AGA TG.

Construction of bacterial strains. (35, 36) Strain V674E was made fromKS1 via a Plvir transduction from SK037 essentially as described.^([14])Incubation times were increased to 30 minutes for infection and 60minutes after addition of citrate in order to accommodate slower growingtimes for the mutated strain. The infected cells are then plated on LBplates containing 12 ^(μ)g/mL tetracycline and 20 mM sodium citrate andincubated at 37° C. overnight. In addition, the positive transductantsare verified by streak purification on an LB plate with 12 ^(μ)g/mLtetracycline, 100 ^(μ)g/mL kanamycin, and 20 mM sodium citrate to givestrain V674E.

Electrocompetent cells of strain V674E were made by standard methods(35); however, the transformation efficiency of this strain is low(10⁶-10⁷) due to the tolC⁻ mutation. Using standard methods, both thepAC^(λ)cI32 and the pBR^(α)LN plasmids or appropriate derivatives weretransformed into the strain. Transformants were grown on LB with 100^(μ)g/mL ampicillin and 6 ^(μ)g/mL chloramphenicol. Note:Chloramphenicol is also a substrate for tolC so levels above 8 ^(μ)g/mLsignificantly impair growth and viability of tolC strains. In additionMacConkey plates can't be used because MacConkey media is toxic tostrains bearing a tolC⁻ mutation.

LacZ assays. (7, 20, 35) The bacterial strains were stored as 20%glycerol stocks in 96-well plates at −80° C. To do the plate assay, thebacterial strains were first phroged to LB liquid media containing 100^(μ)g/mL ampicillin and 6 ^(μ)g/mL chloramphenicol in a 96-well plateand then incubated in a 37° C. shaker overnight. These saturatedcultures were used to innoculate a second 96-well plate containing freshLB liquid media with 100 μg/mL ampicillin, 6 μg/mL chloramphenicol, and0.5 mM IPTG using the phrog. These cultures were grown at 37° C. shakingat 240 rpm for 4-5 hours until the cultures were just beginning to beturbid. These cultures were then phrogged to X-gal plates and X-galplates with varying concentrations of Mtx-SLF and incubated at 37° C.The X-gal plates are LB solid media plates containing 40 μg/mL X-gal,0.5 mM IPTG, 0.5 tPEG (a β-galactosidase inhibitor), 100 μg/mLampicillin and 6 μg/mL chloramphenicol. X-gal hydrolysis was firstvisible at about 8 hours and increased until about 18 hours of growth atwhich point it appeared to reach a maximum.

For the liquid assays, the strains were incubated overnight in LB liquidmedia containing 100 μg/mL ampicillin, 6 μg/mL chloramphenicol, and 0.5mM IPTG overnight. These cultures were used to innoculate 1000 fold intoLB liquid media containing 100 μg/mL ampicillin, 6 μg/mLchloramphenicol, 0.5 mM IPTG, and small molecule (per concentrationindicated in each experiment) and grown to a final OD₆₀₀ of 0.4-0.6 in a37° C. shaker which took about 5 hours. No tPEG is used in the liquidassays in order to get a true quantization of the transcription. Then,the protocol from Miller was followed exactly using 0.1 mL cell lysatefor the assay (35).

Part III—Screening

Using standard β-galactosidase assays on plates (35), we establishedthat Mtx-SLF activates transcription of the lacZ reporter gene in E.coli strain V674E. For the plate assays, mid-log phase liquid culturesof the strains were transferred to a luria broth (LB) plate containingMtx-SLF, 5-bromo-4-chloro-3-indolyl-β-D-galactoside (X-gal),isopropyl-β-D-thiogalactopyranoside (IPTG),phenylethyl-β-D-thiogalactoside (tPEG, a β-galactosidase inhibitor), andthe appropriate antibiotics. The plates were then incubated for 18 hoursat 37° C. The concentrations of Mtx-SLF were varied between 0.1 ^(μ)Mand 10 ^(μ)M (not all shown). On plates containing Mtx-SLF, we observeda substantial increase in the levels of β-galactosidase synthesis overthose due to basal transcription (FIG. 3). Importantly, this increase isdependent on both DHFR and FKBP12 and correlates with the concentrationof Mtx-SLF in the plate media. Also, it is interesting to note that at10 μM Mtx-SLF the levels of small-molecule induced transcription in thethree-hybrid strain appears to be greater than those induced by thedirect protein-protein interaction in the two-hybrid control.

These X-gal plate results were confirmed using more quantitative liquidβ-galactosidase assays. The liquid assays were carried out essentiallyas originally described by Miller (35). Overnight cultures were used toinnoculate fresh LB media containing Mtx-SLF, IPTG, and the appropriateantibiotics. These cultures were grown, lysed, and assayed forβ-galactosidase activity using o-nitrophenyl-β-D-galactopyranoside(ONPG), a chromogenic substrate for β-galactosidase. We once againobserved small-molecule dependent transcriptional activation (FIG. 4).Cells expressing ^(λ)cI-FKBP12 and ^(α)NTD-DHFR showed 6-fold activationat 1 ^(μ)M and 10-fold activation at 10 ^(μ)M Mtx-SLF relative to cellsexpressing only ^(λ)cI and ^(α)NTD. For comparison, the levels oftranscription in cells expressing ^(λ)cI-Gal4 and ^(α)NTD-Gal11^(P) are13-fold those of cells with ^(λ)cI and ^(α)NTD and both are unaffectedby the concentration of Mtx-SLF in the media. As seen in FIG. 5, thelevels of transcriptional activation in the three-hybrid systemcorrelate with the concentration of Mtx-SLF in the media. We begin tosee transcriptional activation at 0.3 ^(μ)m Mtx-SLF, and the levels ofactivation are still increasing at 10 ^(μ)PM Mtx-SLF. At higherconcentrations, Mtx-SLF begins to be toxic to the E. coli cells.Interestingly, at 10 ^(μ)M Mtx-SLF, the levels of transcriptionalactivation in the three-hybrid strain approach those in the two-hybridstrain. We attribute this sensitivity to the picomolar affinity of theMtx/DHFR interaction, although this interpretation has not been proven.Again several independent controls establish that transcriptionalactivation indeed requires both halves of Mtx-SLF (FIG. 4). Neither Mtx,SLF, nor a combination of the two affects the levels of transcription inthe three-hybrid system. At 1 ^(μ)M Mtx-SLF, a 10-fold excess of eitherMtx or the tert-butyl ester of SLF decreased the levels of transcriptionto about half that with 1 ^(μ)M Mtx-SLF alone. Deletion of either DHFRor FKBP1 2 from the ^(λ)cI-FKBP12 and ^(α)NTD-DHFR fusion proteins dropsthe levels of small-molecule induced transcriptional activation to thebackground levels observed with only ^(λ)cI and ^(α)NTD.

DISCUSSION

The bacterial small-molecule three-hybrid system described here providesa robust platform for high-throughput assays based onprotein-small-molecule interact ions. The Mtx-SLF heterodimeric ligandcan be prepared readily and gives a strong transcription readout in theE. coli RNA polymerase three-hybrid system. Notably, the levels oftranscriptional activation with the Mtx-SLF three-hybrid system arecomparable to those with the Gal4-Gal11^(P) interaction, despite thefact that one non-covalent interaction has been replaced with two. Thisresult may speak to the importance of the particularly high affinitybetween Mtx and DHFR.

By adapting bacteria to allow for the use of the Mtx/DHFR interaction,the described three-hybrid system that uses Mtx/DHFR is far superior toprevious bacterial systems of any type. For example, while it is adifferent system and designed for different purpose, the systemdescribed in (9) provides a context in which the described three-hybridsystem based on Mtx/DHFR can be analyzed. The concentrations of smallmolecule in (9) vary between 50 and 350 uM with 250 uM being the firstconcentrations at which activity is conclusive. The described Mtx/DHFRsystem does not need higher than 10 uM and results can be detectedabsolutely at concentrations below 1 uM. In addition, the describedMtx/DHFR system shows a 10-fold increase in signal upon addition ofsmall molecules whereas in (9) shows a signal to noise ratio of at most1.25. In fact, the writers of (9) admitted that screening a librarywould be very difficult with their system, as shown by the fact thatwhen screening a library for activity, they obtain 6 false positives andonly a single true positive hit. In addition, the system described in(9) requires several different fluorescent dyes in order to maintainsignal stability. Thus, the described bacterial three-hybrid systemusing Mtx/DHFR is far superior for screening libraries because of thefar lower concentrations of small molecule requirements due to muchstronger interactions, as well as a vastly improved signal to noiseratio.

Three-hybrid systems provide an in vivo alternative to affinitychromatography that can be used to evolve proteins that recognize aparticular small molecule, to screen a library of small molecules basedon binding to a particular protein, or to screen cDNA libraries to findthe protein targets of drugs or to classify proteins based on theirsmall-molecule interactions. Because of the high transformationefficiency and rapid doubling time of E. coli, this system shouldincrease the number of proteins that can be tested in three-hybridassays by several orders of magnitude compared with yeast systems. Abacterial assay should be particularly advantageous in molecularevolution experiments where on the order of 10⁸ variants may benecessary to alter protein function. Based on our results, we believe Mtx will provide a versatile anchor for presenting a variety of differentsmall molecules.

The yeast three-hybrid assay provided a method for in vivo affinitychromatography that greatly simplifies protein identification andamplification at the end of affinity panning. The bacterial systemdescribed here should increase the number of protein variants that canbe assayed by several orders of magnitude.

BIBLIOGRAPHY

1. S. Fields, O. Song, Nature 1989, 340, 245-246.

2. U.S. Pat. No. 5,928,868, and Licitra, Edward J., et al., PNAS, USA93, 1996, 93, 12817-12821.

3. U.S. Pat. No. 5,468,614, and Yang et al., Nucleic Acid Research 1995,23, 1152-1156.

4. D. Spencer, T. Wandless, S. Schreiber, G. Crabtree, Science 1993,262, 1019-71024.

5. M. Farrar, J. Alberola-Ila, R. Perlmutter, Nature 1996, 383, 178-181.

6. J. Amara, T. Clackson, V. Rivera, T. Guo, T. Keenan, S. Natesan, R.Pollock, W. Yang, N. Courage, D. Holt, M. Gilman, Proc. Natl. Acad. Sci.USA 1997, 94, 10618-10623.

7. H. Lin, W. Abida, R. Sauer, V. Cornish, J. Am. Chem. Soc. 2000, 122,4247-4248.

8. J. Hu, M. Kornacker, A. Hochschild, Methods 2000, 20, 80-94 andreferences within.

9. S. Firestine, F. Salinas, A. Nixon, S. Baker, S. Benkovic, Nat.Biotechnol. 2000, 18, 544-547.

10. S. Kopytek, R. Standaert, J. Dyer, J. Hu, Chem. Biol. 2000, 7,313-321.

11. J. Bolin; Filman, D.; Matthews, D.; Hamlin, R.; Kraut, J. J. Biol.Chem. 1982, 257, 13663-13672.

12. T. Huang; Barclay, B.; Kalman, T.; vonBorstel, R.; Hastings, P. Gene1992, 121,167-171.

13. J. Kralovec; Spencer, G.; Blair, A.; Mammen, M.; Singh, M.; Ghose,T. J. Med. Chem. 1989, 32, 2426-2431.

14. J. March, Advanced Organic Chemistry (1985) pub. John Wiley & SonsInc.

15. H H. House, Modern Synthetic Reactions (1972) pub.Benjamin Cummings.

16. S. Dove, J. Joung, A. Hochschild, Nature 1997, 386, 627-630.

17. S. Michnick, I. Remy, F. Campbell-Valois, A. Vallee-Belisle, J.Pelletier, Methods Enzymol. 2000, 238, 208-230.

18. G. Karimova, J. Pidoux, A. Ullmann, D. Ladant, Proc. Natl. Acad.Sci. USA 1998, 95, 5752-5756.

19. T. Ozawa, S. Nogami, M. Sato, Y. Ohya, Y. Umezawa, Anal. Chem. 2000,72, 5151-5157.

20. S. Dove, F. Huang, A. Hochschild, Proc. Natl. Acad. Sci. USA 2000,97, 13215-13220.

21. S. Dove, A. Hochschild, Genes & Dev. 1998, 12, 745-754.

22. J. Alam and Cook, J. L., Anal. Biochem. 188: 245-254, (1990).

23. N. Rosenthal, Methods Enzymo. 152: 704-720 (1987).

24. A. Shiau and Smith, J. M., Gene 67: 295-299 (1988).

25. S. Benkovic, C. Fierke, A. Naylor, Science 1988, 239, 1105-1110.

26. W. Abida, B. Carter, E. Althoff, H. Lin, V. Cornish, submitted forpublication 2002.

27. T. Keenan, D. R. Yaeger, N. L. Courage, C. T. Rollins, M. E. Pavone,V. M. Rivera, W. Yang, T. Guo, J. F. Amara, T. Clackson, M. Gilman, D.A. Holt, Bioorg. Med. Chem. 1998, 6, 1309-1335.

28. D. Holt, J. Luengo, D. Yamashita, H. Oh, A. Konialan, H. Yen, L.Rozamus, M. Brandt, M. Bossard, M. Levy, D. Eggleston, J. Liang, L.Schultz, T. Stout, J. Clardy, J. Am. Chem. Soc. 1993, 115, 9925-993.8.

29. T. Clackson, Curr. Opin. Chem. Biol. 1997, 1, 210-218.

30. B. Hart, W. Haile, N. Licato, W. Bolanowska, J. McGuire, J. Coward,J. Med. Chem. 1996, 39, 56-65.

31. L. Liu, R. Tanke, M. Miller, J. Org. Chem. 1986, 51, 5332-5337.

32. S. Kopytek, J. Dyer, G. Knapp, J. Hu, Antimicrob. Agents Chemother.2000, 44, 3210-3212.

33. F. Ausubel, R. Brent, R. Kingston, D. Moore, J. Seidman, J. Smith,K. Struhl, Current Protocols in Molecular Biology, John Wiley & Sons,New York, 1995.

34. J. Sambrook, E. Fritsch, T. Maniatis, Molecular Cloning a LaboratoryManual, 2nd ed., Cold Spring Harbor Laboratory Press, Plainview, N.Y.,1989.

35. J. H. Miller, A Short Course in Bacterial Genetics: A LaboratoryManual and Handbook for Escherichia Coli and Related Bacteria, ColdSpring Harbor Laboratory Press, Plainview, N.Y., 1992.

36. P1 Transduction,http://hulab.tamu.edu/Protocalls/Transduction/Plvir%20Transduction.html.37. D. Ladant and G. Karimova, Genetic systems for analyzingprotein-protein interactions in bacteria, Res. Microbiol., 2000,151:711-720.

38. J. Pelletier, F. -X. Campbell-Valois, and S. Michnick,Oligomerization domain-directed reassembly of active dihydrofolatereductase from rationally designed fragments, PNAS USA, October 1998,95:12141-12146.

1. A transgenic bacterial cell comprising (a) a dimeric small moleculewhich comprises a first moiety known to bind a first receptor domaincovalently linked to a second moiety capable of binding a secondreceptor domain, wherein the first and second moieties are different;(b) nucleotide sequences which upon transcription encode i) a firstfusion protein comprising the first receptor domain, and ii) a secondfusion protein comprising the second receptor domain; and (c) a reportergene wherein expression of the reporter gene is conditioned on theproximity of the first fusion protein to the second fusion protein: 2.The bacterial cell of claim 1, wherein the dimeric small molecule hasthe formula:H1-Y-H2 wherein each of H1 and H2 may be the same or different andcapable of binding to a receptor which is the same or different with aIC₅₀ of less than 100 μM; and wherein Y is a linker which may be presentor absent.
 3. The bacterial cell of claim 2, wherein H1 or H2 is capableof binding to a receptor with a IC₅₀ of less than 10 μM.
 4. Thebacterial cell of claim 2, wherein H1 or H2 is capable of binding to areceptor with a IC₅₀ of less than 1 μM.
 5. The bacterial cell of claim2, wherein H1 or H2 is capable of binding to a receptor with a IC₅₀ ofless than 100 nM.
 6. The bacterial cell of claim 2, wherein H1 or H2 iscapable of binding to a receptor with a IC₅₀ of less than 10 nM.
 7. Thebacterial cell of claim 2, wherein H1 or H2 is capable of binding to areceptor with a IC₅₀ of less than 1 nM.
 8. The bacterial cell of claim2, wherein H1 or H2 is a methotrexate moiety, FK506 moiety, an FK506analog, a tetracycline moiety, or a cephem moiety.
 9. The bacterial cellof claim 8, wherein H1 or H2 is a methotrexate moiety.
 10. The bacterialcell of claim 8, wherein H1 or H2 is an FK506 analog.
 11. The bacterialcell of claim 10, wherein the FK506 analog has the structure:


12. The bacterial cell of claim 2, wherein the dimeric small moleculehas the structure:

wherein n is an integer from 1 to
 20. 13. The bacterial cell of claim12, wherein n is an integer from 2 to
 12. 14. The bacterial cell ofclaim 12, wherein n is an integer from 3 to
 9. 15. The bacterial cell ofclaim 12, wherein n is
 8. 16. The bacterial cell of claim 1, wherein thefirst fusion protein further comprises a DNA binding domain, and thesecond fusion protein further comprises a transcription activationdomain.
 17. The bacterial cell of claim 1, wherein the first fusionprotein further comprises a transcription activation domain, and thesecond fusion protein further comprises a DNA binding domain.
 18. Thebacterial cell of claim 16, wherein the transcription activation domainis αNTD.
 19. The bacterial cell of claim 16, wherein the DNA-bindingdomain is λcI, AraC, LexA, Gal4, or zinc fingers.
 20. The bacterial cellof claim 1, wherein the first or the second receptor domain is that ofdihydrofolate reductase (“DHFR”), glucocorticoid receptor, FKBP12, FKBPmutants, tetracycline repressor, or a penicillin binding protein. 21.The bacterial cell of claim 20, wherein the DHFR is the E. coli DHFR(“eDHFR”).
 22. The bacterial cell of claim 1, wherein the first fusionprotein is DHFR-λcI or FKBP12-λcI.
 23. The bacterial cell of claim 1,wherein the second fusion protein is DHFR-αNTD or FKBP12-αNTD.
 24. Thebacterial cell of claim 1, wherein the reporter gene is Lac Z, araBAD,aadA (spectinomycin resistance), his3, β-lactamase, GFP, luciferase,TetR (tetracyclin resistance), KanR (kanamycin resistance), Cm(chloramphenicol resistance).
 25. The bacterial cell of claim 24,wherein the reporter gene is Lac Z.
 26. A transgenic bacterial cellcomprising (a) a dimeric small molecule which comprises a methotrexatemoiety covalently linked to a moiety capable of binding a receptordomain; (b) nucleotide sequences which upon transcription encode i) afirst fusion protein comprising a DHFR domain, and ii) a second fusionprotein comprising the receptor domain; and (c) a reporter gene whereinexpression of the reporter gene is conditioned on the proximity of thefirst fusion protein to the second fusion protein.
 27. The bacterialcell of claim 26 wherein the moiety known to bind a receptor domain iscapable of binding to a receptor with a IC₅₀ of less than 100 nM. 28.The bacterial cell of claim 26, wherein the moiety known to bind areceptor domain is capable of binding to a receptor with a IC₅₀ of lessthan 10 nM.
 29. The bacterial cell of claim 26, wherein the moiety knownto bind a receptor domain is capable of binding to a receptor with aIC₅₀ of less than 1 nM. 30-76. (canceled)
 77. A transgenic bacterialcell comprising (a) a dimeric small molecule which comprises amethotrexate moiety covalently linked to a moiety capable of binding areceptor domain; (b) nucleotide sequences which upon transcriptionencode i) a first fusion protein comprising a DHFR domain and a firstfragment of an enzyme, and ii) a second fusion protein comprising thereceptor domain and a second fragment of the enzyme, wherein activity ofthe enzyme is conditioned on the proximity of the first fragment of theenzyme to the second fragment of the enzyme.
 78. The bacterial cell ofclaim 77, wherein the moiety capable of binding the receptor domain iscapable of binding with a IC₅₀ of less than 100 μM.
 79. The bacterialcell of claim 77, wherein the moiety capable of binding the receptordomain is capable of binding with a IC₅₀ of less than 100 nM.
 80. Thebacterial cell of claim 77, wherein the moiety capable of binding thereceptor domain is capable of binding with a IC₅₀ of less than 1 nM.81-82. (canceled)
 83. The bacterial cell of claim 17, wherein thetranscription activation domain is αNTD.
 84. The bacterial cell of claim17, wherein the DNA-binding domain is λcI, AraC, LexA, Gal4, or zincfingers.